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ABSTRACT 

Visualization is a powerful paradigm for exploratory data analysis. 
Visualizing large graphs, however, often results in a meaningless 
hairball. In this paper, we propose a different approach that helps 
the user adaptively explore large million-node graphs from a lo¬ 
cal perspective. For nodes that the user investigates, we propose 
to only show the neighbors with the most subjectively interesting 
neighborhoods. We contribute novel ideas to measure this inter¬ 
estingness in terms of how surprising a neighborhood is given the 
background distribution, as well as how well it fits the nodes the 
user chose to explore. 

We introduce Facets, a fast and scalable method for visually 
exploring large graphs. By implementing our above ideas, it allows 
users to look into the forest through its trees. Empirical evaluation 
shows that our method works very well in practice, providing rank¬ 
ings of nodes that match interests of users. Moreover, as it scales 
linearly, FACETS is suited for the exploration of very large graphs. 

Categories and Subject Descriptors 

H. 2.8 [Databasemanagement]: Database applications-Data mining 
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I. INTRODUCTION 

Large graphs are ubiquitous. They are natural representations 
for many domains, and hence we find graph structured data every¬ 
where. As data collection becomes increasingly simple, and many 
domains remain complex, real-world graphs are growing extremely 
large and rich with data. These graphs may have thousands or more 
of attributes and scale up to and beyond millions of nodes and bil¬ 
lions of edges. It is fair to say that many graphs are in fact too big; 
exploring such large graphs, where the goal of the user is to gain 
understanding, is a highly non-trivial task. 

Visualization is perhaps the most natural approach to exploratory 
data analysis. Under the right visualization, finding patterns, decid- 



Big Mommas House 2 

Congeniality 

r- . . Pretty Woman 

My Fair Lady •— 

Cinderella / s^\.^Oceans Eleven 

Maid in Manhattan’" 

Oceans Twelve < 


Mommas 


House 


Catch Me If You Can \ 


The Italian Job 


(a) Hairball 


(b) Facets reveals insights 


Figure 1: (a) The Rotten Tomatoes movie graph shown using 
conventional spring layout (an edge connects two movie nodes if 
some users voted them as similar). Even for this relatively small 
graph of 17k nodes and 72k edges, a global visualization does 
not provide much insight, (b) A better way, using our Facets 
approach, focuses on movies that are the most subjectively in¬ 
teresting, surprising, or both. For example, FACETS suggests 
Pretty Woman (romantic-comedy) as a interesting, suprising re¬ 
lated movie of Miss Congeniality (crime-comedy). 


ing what is interesting, what is not, and what to investigate next are 
all easy tasks - in a sense the answers ‘jump to us’ as our brains are 
highly specialized for analyzing complex visual data. It is there¬ 
fore no surprise that Shneiderman’s mantra of “overview, zoom & 
filter, details-on-demand” has proven to be successful in many 
domains 

Visualizing large graphs in an intuitive and informative manner 
has proven to be rather difficult. Even using advanced layout tech¬ 
niques plotting a graph typically leads to a useless “hairball” from 
which nothing can be deduced fTSlpA) . This is even the case when 
we plot graphs with only thousands of nodes (see Eigure [^a) for 
an example). Instead of plotting the whole graph, visualizing only 
part of the graph seems more promising Doing this naively 

leads to the same problem. Because real world graphs are often 
scale free (follow a power law degree distribution j^), even a single 
hop expansion from a selected node can be visually overwhelming. 

We take a different approach. We propose to adaptively explore 
large graphs from a local perspective. That is, starting from an 
initially selected node - e.g., explicitly queried by the user, or pro¬ 
posed by an outlier detection algorithm Q - we propose to only 
show the most interesting neighbors. We identify these by their 
subjective interestingness, by how surprising their neighborhood 
distribution is (e.g., do neighbors’ degree distributions follow a 
power law like when considering all nodes?), as well as by how 
similar this distribution is to those of the nodes the user already 
visited during the exploration. By only showing those parts of 
the graph that are most informative to the user, we keep the view 



clean. By being adaptive, Facets allows every user to explore 
those facets of the graph that are most interesting to them. 

We call our adaptive approach Facets — our idea is a signifi¬ 
cant addition to existing works that aim to recommend individual 
nodes to users; instead, we steer users towards local regions of the 
graphs that match best with their current browsing interests, helping 
them better understand and visualize the graph at the same time. 

To illustrate how Facets works in practice, consider our user 
Susan who is looking for interesting movies to watch (see Figure[^, 
by exploring a Rotten Tomatoes movie similarity graph with 17k 
movies. In this graph, an edge connects two movie nodes if users 
of RottenTomatoes voted them as similar films. Susan has watched 
Miss Congenialitj^ a crime-comedy that stars Sandra Bullock as 
an FBI agent who thwarts terrorist efforts by going undercover, 
turning her rude unflattering self into a glamorous beauty queen 
(see Figure[^). Facets simultaneously suggest a few movies that 
are interesting and surprising to her. 

Matching Susan’s interest. Facets suggests the Big Mommas 
House series, which also have undercover plots and are interest¬ 
ingly like Miss Congeniality. They both share low critics scores, 
but high audience scores (i.e., most critics do not like them, but 
people love them). To Susan’s surprise. Facets suggests Pretty 
Woman, which is quite different (thus surprising) — a romantic- 
comedy that has both scores from the critics and the audience. But, 
there is still more subtle similarity (thus still drawing Susan’s in¬ 
terest); both films share a Cinderalla-like storyline, which explains 
why the two movies are connected in the graph: Sandra Bullock 
goes from a rude agent to a beauty queen; in Pretty Woman, Julia 
Roberts goes from a prostitute to a fair lady. In fact. Pretty Woman 
is a classic, exemplar romantic-comedy; many movies follow sim¬ 
ilar story lines (e.g.. Maid in Manhattan). Thus, Pretty Woman has 
very high degree in the graph, unlike Miss Congeniality which is a 
niche genre; this also contributes to Pretty Woman's surprisingness. 

Through Pretty Woman, FACETS again pleasantly surprises Su¬ 
san with Oceans Eleven, which also stars Julia Roberts, and is in a 
rather different light hearted crime or heist genre, introducing Su¬ 
san to other very similar movies like Oceans Twelve and The Italian 
Job. Figure[2b) summarizes Susan’s exploration. If Susan were to 
use a conventional visualization tool to perform the same kind of 
movie exploration, she would likely be completely overwhelmed 
with an incomprehensible hairball visualization (as in Figure [^a)). 

The key contributions of our work include: 

• A framework for locally exploring a graph without clutter, 
showing only the most subjectively most interesting nodes, 
and hence being adaptive to the users’ interests. 

• A formal notion of subjective interestingness for graph ex¬ 
ploration taking both divergence between local and global 
distributions, and similarity to explored nodes into account. 

• A measure of surprise over neighborhoods - rather than local 
node attributes - to draw users in the direction of graph areas 
with subjectively interesting content. 

• A highly scalable method. Facets, for adaptively exploring 
very large graphs in a visual environment. Experimental ev¬ 
idence demonstrates the effectiveness of Facets. 

The rest of the paper is organized as follows. First, in Sec. we 
formalize the problem, introduces our notions of interestingness, 
and propose our Facets solution. Then, in Sec. we present our 
ideas as an integrated approach, with visualization and algorithms 

^One of the most frequently rated movies on Netflix 


working closely together. We empirically evaluate in Sec. We 
discuss related work in Sec.[^ and end with conclusions in Sec.[^ 

2. FACETS: 

ADAPTIVE GRAPH EXPLORATION 

In this section, we formalize the problems we aim to solve through 
our Facets approach to achieve adaptive exploration. Then, we 
describe our main ideas, and describe our solutions. 

To enhance readability, we have listed the symbols used in this 
paper in Table The reader may want to return to this table for 
technical terms meanings used in various contexts of discussion. 


Symbol 

Description 

Vi 

Node i 

Djs 

Jensen-Shannon Divergence 

Dkl 

Kullback-Leibler Divergence 

Si 

surprise-score for node Vi 

Ti 

interest-score for node Vi 

Sa 

surprise scores for all neighbors of Va 

Ra 

interest scores for all neighbors of Va 

fj 

a j-th feature for nodes 

Ai 

weight of feature fj 


neighborhood dist. of node Vi for feature fj 

Gj 

global dist. for feature fj 

Uj 

user profile dist. for feature fj 


Table 1: Symbols & Notation 


2.1 Problem Definition 

The input is a graph G = {V,E, A) where A is a set of numer¬ 
ical or categorical attributes. Each node Vi C V has a correspond¬ 
ing attribute value for each attribute (feature) fj G A (e.g., degree). 
Our approach works with both categorical and numerical attributes. 
We assume there are no self-edges. 

We solve the following problem with Eacets : 

Definition 1. Node-Sequence Aware Ranking. Given a start¬ 
ing node Ua, a sequence of nodes 14 C V in which a user has 
shown interest, how can we find the iop-k nodes among the neigh¬ 
bors of Va that balance (1) similarity by features to the sequence of 
Vh nodes (subjective interest) and (2) uncommon compared to the 
global distribution (surprise). 

Graph exploration is an interactive and iterative process, where 
the user incrementally explore larger parts of the graph. Eacets 
solves the above problem repeatedly. 

A common approach to rank nodes is by their importance scores, 
which are often computed using PageRank pO) , Personalized PageR- 
ank (m or random walk with restart dz) 

However, there are other ways to rank the nodes, like using sur¬ 
prise or interest fT^[^. We have chosen to rank nodes by their sur¬ 
prise and user-driven interest rather than by the more conventional 
importance metrics. We chose surprise, because serendipitous re¬ 
sults and insight do not always come from the most topologically 
important nodes GD- We made Eacets adaptive, because what 
makes a nodes interesting varies from person to person and so too 
should be the criteria for ranking. Eor each node we suggest a com¬ 
bination of the most surprising and most interesting neighbors at 
each step of the journey. 






2.2 Feature Distributions 

Facets uses feature-based surprise and interest in order to guide 
the graph exploration process. Even when a dataset does not con¬ 
tain node-level features, we can derive node features by using es¬ 
tablished approaches like PageRank, centrality measures or labels 
drawn from clustering approaches. This means that even without a 
set of initial features, it’s still possible for Facets to guide graph 
exploration. 

Facets requires a compact representation of feature distribu¬ 
tions. Histograms are a natural and computationally inexpensive 
way to represent distributions. Our approach can consider any 
histogram, regardless of the binning strategy - e.g., equi-width 
or equi-height binning - used to infer the histogram. Here, we 
opt to use the parameter-free technique by Kontkanen and Mylly- 
maki GD that is based on the Minimum Description Length (MDL) 
principle. In a nutshell, it identifies as the best binning the one that 
best balances the complexity of the histogram and the likelihood of 
the data under this binning. In practice this means it automatically 
chooses both the number of and locations for the cut points that 
define the histogram depending on the complexity and size of the 
data. 

Definition 2. Representing Local Feature Distributions. We 

first create a histogram for a given feature fj and a set of nodes V 
with their feature values. A histogram consists a set of bins b E 
Bj each of which has a probability value based on the number of 
nodes corresponding to. Although we have chosen MDL binning to 
construct our histograms, FACETS will work with most histograms 
and binning approaches. 

The neighborhood (or local) distribution Lij is a distribution 
of features fj over a set of neighbors of a particular node Vi ; The 
global distribution Gj is the feature distribution across all nodes; 
and the user profile distribution Uj is the distribution of a se¬ 
quence of interesting nodes Vh collected from the user during in¬ 
teraction with Facets. 

Facets works by guiding users during their graph exploration 
using both surprisingness (Section [O] ) and subjective interest that 
changes dynamically to suit the user (Section \2A\ . We do this 
comparing the local or neighborhood feature distributions with the 
global to determine surprisingness and the local with a user profile 
to determine dynamic subjective interest. 

2.3 Ranking by Surprise 

In order to calculate a node’s surprisingness we compare the dis¬ 
tribution of the node’s neighbors with the global distribution for 
each feature. We chose a combined feature-centric and structural 
approach, because both structure and features play a critical role 
in inference problems (Tg. Nodes whose local neighborhood vary 
greatly from the global are likely to be more surprising as they do 
not follow the general global trends. 

One approach is to use the base entropy over node features to 
detect anomalous nodes; however, this ends up biasing the ranking 
towards skewed distribution. Instead we measure the difference be¬ 
tween two distributions for more consistent results. Through our 
experiments we have chosen Jensen-Shannon (JS) divergence, a 
symmetrical version of Kullback-Leibler divergence to construct 
our surprisingness metric. JS divergence works well, because the 
resulting output is in bits so the divergences of several features can 
be easily combined into a single score. We measure surprise by de¬ 
termining the divergence of feature distributions Lij over a node’s 
neighborhood Va (1 hop), from the global distributions of features 
G (see Equation!^. From these scores we select the top-k most 
surprising nodes (Equation]^. 



Figure 2: Local and global distribution histograms are both 
essential to Facets. Local histograms (orange bars) are rep¬ 
resentations of feature distributions in a single node’s egonet. 
The global distributions (gray bars) depicts the corresponding 
feature’s distribution across the whole graph. The difference 
between those two distributions refiects if a node is "unusual" 
or surprising compared to the majority in the graph. 

Given the JS Divergence or information radius between two dis¬ 
tributions P and G\ 

Djs{P\\G) = Id{P\\Q) + ^D{G\\Q), (1) 

where Q = ^(P + G) and D{P\\G) is the KL divergence for 
discrete distributions: 

/3(P||G) = ^P(6)logL| (2) 

In Equation]^ we use base 2 so that 0 < Djs{P\\G) < 1. For a 
fresh node Va, whose neighbors are not yet visualized we first com¬ 
pute the surprise-score, s^, of all neighboring nodes Vi G N{va): 

Si= ^jDjsiLijWGj), (3) 

peA 

where Lj and Gj are the local and global distributions of node¬ 
feature fj and \j is a feature weight. Weighted feature scores in 
Equation are used to lessen the impact of noisy features and to 
allow the user to lessen the contribution of a feature manually. The 
Si scores are composed into Sa, which holds all the scores for the 
neighbors of initial node Va. We can find the most surprising k- 
nodes by looking for the largest divergence from the global: 

arg max Sa (4) 

l...k 

This yields the top-k most surprising nodes among the neighbors of 
node Va. Since both the local-neighborhood feature distributions 
and the global feature distributions are static, the surprise scores 
can be precomputed to improve real time performance. We pre¬ 
compute and store surprise in FACETS to improve performance. 

2.4 Ranking by Subjective Interestingness 

We track the user’s behavior and record a user profile as they ex¬ 
plore their data. Each clicked node offers valuable details into the 
types of nodes in which the user is interested. This forms distribu¬ 
tions Uj for each feature fj . 

To rank the user’s interest in the undisplayed neighbors of node 
Va we follow a similar approach as Equation 

ri= >^iDjs{Lij\\Ui), (5) 

peA 




where Uj is the distribution of feature fj from the user’s recent 
node browsing. In this case we want the local distributions that 
match better the user’s current profile; i.e. we want the smallest 
possible divergences: 

arg min Ra (6) 

l...k 

Since this suffers from the cold-start phenomenon, because a user 
will not have a profile until they have explored some nodes, our 
remedy is to simply start the suggestions with surprising and im¬ 
portant nodes, until the user has investigated several nodes. 

3. OUR INTEGRATED APPROACH 

The visual aspects of Facets with the algorithmic rankings de¬ 
scribed previously. In this section we cover the graph visualization 
and design of our approach. 

3.1 Components 

We have created an adaptive graph exploration tool. Facets, 
for performing fast and intuitizve exploration of graph datasets. 
Facets was designed especially for graph tasks that require node¬ 
level details. 

Facets ’s user interface as shown in Figure has four key ele¬ 
ments: The first main area is the Table View (1), a conventional 
table showing the currently displayed nodes and some of their fea¬ 
tures. This provides sortable node-level information. The central 
area is the Graph View (2). It is an interactive graph 
layout that demonstrates the structure and relationships among nodes 
as the user explores. Coloring the nodes is used to encode the sur¬ 
prise and interest based on the user’s current exploration. Visual 
scalability is an enormous challenge for nodes with many neigh¬ 
bors. Because we can only show a subset of the total nodes, we 
created the Neighborhood Summary (3). The neighborhood sum¬ 
mary allows a user to investigate the feature distributions of its cur- 
rentlyundisplayed neighbors. It presents the user with feature heat 
map^ that summarize the distributions of hidden nodes. When 
clicked, the heat maps turn into conventional distribution plots (his¬ 
tograms), where a user can compare the local neighborhood (or¬ 
ange) and the global (gray). This lets a user quickly select new 
nodes based on their feature values and get a quick summary of 
this node’s neighborhood. As a user explores, we construct and 
display a summary profile of the important features they have cov¬ 
ered in the User Profile view (4). The user profile view suggests 
high-level browsing behavior to the user; allows for better under¬ 
standing of where the user-interest ranking comes from; and allows 
them to adjust if they want to ignore certain features in the interest 
ranking. 

3.2 Design Rationale 

In the following paragraphs, we discuss Facets ’s contributions 
to graph exploration. 

Exploring and Navigating 

We use the term graph navigation to refer to the act of travers¬ 
ing graph data with a known destination or objective. Graph ex¬ 
ploration is similar to navigation except that the user has little 
or no particular destination. One of our central design goals is 
to facilitate both exploration and navigation. We facilitate navi¬ 
gation through adaptation and exploration by filtering out unsur¬ 
prising and unimportant nodes while still providing crucial feature 

^While histogram encodes value of each bin as height, heat map 
uses darkness to represent values with equal height. The main ad¬ 
vantage of heat map over histogram is its compact representation, 
which helps us save space. 
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Figure 4: Facets’ s color encoding of user-interest and sur¬ 
prise using hue and saturation. The horizontal axis indicates 
the overall score of a node; nodes with high scores will be lo¬ 
cated at the right of the chart and be colored with vibrant color. 
The vertical axis shows how much more user-interest score is 
than surprise score for a node. 


details for hidden nodes via the Neighborhood Summary window. 
As shown in figure the user can have a summarized view of a 
mouse-hovered nodes where the top ranked hidden neighbors, lo¬ 
cal distribution and global distribution are displayed. This lets the 
user be able to go through many nodes quickly in seek of nodes 
with desired feature and add those desired nodes to graph view eas¬ 
ily from the top hidden node list or bin. 

Show the Best First 

Keeping the graph view from becoming a hairball means only show¬ 
ing relevant, surprising, and interesting nodes. Importance, sur¬ 
prise, and user-interest are all important aspects of discovery, so we 
blend them into the results that are shown first to the user. Figure]^ 
illustrates how we visually encode the interest-surprise difference 
by hue and the sum of both scores by saturation. Nodes ranked high 
tend to have brighter color closer to purple, which becomes a clear 
visual cue for the user to quickly identify desired nodes. Facets is 
almost completely free of parameters, making it simpler for users 
to explore their graphs. 

Adaptive and Adjustable 

Because user-interest varies greatly across users and even time, our 
design must be able to track the user’s exploration behavior in order 
to approximate what is motivating them. Adapting as the user ex¬ 
plores helps provide critical insight into users’ latent objectives, be¬ 
cause they can see how they have explored and also may find what 
they seek. During exploration, the users profile updates dynami¬ 
cally to illustrate a summary of their feature traversal, while the 
graph view provides the topological traversal. Even though there 
is no such necessarity to preset any parameters in order for our 
adaptive algorithm to work and the rankings are done in a black¬ 
box fashion during users’ exploration, we allow them to directly 
manipulate the balance of features used in the interest calculation 
as well as the choice of which aspects form the ranking. Thereby 
when deciding to focus on only some of the features or weigh some 
feature less after having done some exploration with an established 
user profile, the user has the option to do so. 

3.3 The EACETS Algorithm 

In this section, we summarize the process of finding iop-k most 
interesting and surprising neighbors in Algorithm Whenever a 
user selects a node to explore, we rank its neighbors based on sur¬ 
prise and subjective interestingness we explained in Sec. |2.3| and 
|2.4| For each of the neighbors, we compute surprise and interest 
scores for each feature and aggregate them based on feature weights 
Xj . We blend those scores, and finally, nodes with k highest scores 
will be returned. 
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Figure 3: The Facets user interface displaying an explored portion of the RottenTomatoes similar-movie graph. The user has 
traversed several films (shown as orange nodes with titles) and Facets has displayed a subset of the relevant neighbors (the nodes 
with colors ranging from blue to red) and their connectivity. 1: The Table View provides a conventional summarization of the already 
explored nodes and some of their features. 2: The Graph View shows the connectivity of similar films as the user explores (it is linked 
with the table so that a selected node is highlighted in both views. 3: The Neighborhood Summary shows the currently hidden nodes, 
by their feature distributions, for a selected film. 4: The User Profile demonstrates a heat-map or fiat histogram of the features the 
user has covered so far in their exploration. Figures are viewed best in color. 


4. EXPERIMENTS 

In this section, we evaluate the effectiveness and speed of Facets. 
Our goal with Facets is to facilitate user discovery by directing 
users towards regions of the graph with features suited to their in¬ 
terests. Our goal is exploratory search fTS) ; we are not trying to 
improve the accuracy of individual node recommendations, so it is 
not relevant to use canonical evaluation metrics like precision, re¬ 
call, MAE, and RMSE |[^|^[T^. Instead, we demonstrate the 
effectiveness of our approach with (1) three case studies that in¬ 
vestigate the results of our algorithm and (2) a comparison of our 
scoring with canonical node ranking techniques. 

4.1 Datasets 

We consider four lar^ graphs to evaluate our method. Eirst, we 
use the RottenTomatoe^ (RT) movie dataset, an attributed graph 
that contains basic information per movie (e.g., released year), as 
well as users’ average ratings and critics scores. Second, we use 
the Google Web network, the DEEP co-authorship graph, and the 
YouTube network datasets from the SNAP repository fTT). We give 
the base characteristics of these graphs in Tablej^ Eacets ignores 
nodes with zero degree, because they aren’t very useful for graph 
exploration. 


''http : // rottentomatoes . com/ 


4.2 Runtime Analysis 

Next, we evaluate the scalability of Eacets over several graphs. 
In particular, we will demonstrate its linear cost in the exploratory 
rankings. We note that the goal of this evaluation is mostly proof- 
of-concept - in this paper it is not our focus to get the lowest possi¬ 
ble ranking time, which could be achieved via better hardware and 
further engineering. 

Guided graph exploration requires sub-second rankings in order 
to remain smooth and reactive to user input. We must have rankings 
ready to display in under a second and ideally under half a second. 
This is why we have chosen to treat nodes in the tail of the degree 
distribution separately than their modest degree neighbors. 

We have analyzed the runtime of Facets, in Figureusing the 
graphs from Table all but the RT graph used eight synthetic fea¬ 
tures. In our tests we use both random ordering and contiguous 
node ordering, displayed as Rand and Hop in Figure Random 
ordering simulates using the search functionality while the hop or¬ 
dering simulates hopping from one node to its neighbors during 
exploration. During the hop tests, the choice for the next hop must 
be among connected nodes that have not yet been traversed. High 
degree nodes have a higher chance of being selected and account 
for the fact that hop sometimes is slower than random in Figure 
The graphs we tested demonstrate that the cost of the ranking is lin¬ 
ear in the number of neighbors in the neighborhood. Our ranking 
requires both a value lookup and a single IS divergence calculation 
for each node and for each feature. 












Algorithm 1: RankNeighbors 

input : node Va, precomputed histograms for: global feature 
distributions G and user profile distributions U 

output: k most interesting and surprising neighbors of node Va 

if deg(va) > 1000 then 

\_ Va ^ top 1000 neighbors of Va by highest degree 

else 

|_ Va ^ all neighbors of Va 

forall the nodes Vi in Va do 

forall the features fj in A do 

s\^'' = Djs{Lij\\Gj) 11 see Eq. ^ 

=Djs{U^i\\Ui) // see Eq. II 

tP = WasP + Wr{l - rP) 

_ Ta[i\=j:f,eA^itP 

TNodes = argmaxi...fc Ta 


Network 

Nodes 

Edges 

Rotten Tomatoes 

17 074 

72140 

DBLP 

317 080 

1 049 866 

Google Web 

875 713 

5105 039 

Youtube 

1 134 890 

2987 624 


Table 2: Graph datasets used in our speed testing. They were 
picked for their variety in size and domain. 


As mentioned in Section the surprise scores are precomputed 
and can be accessed quickly. The cost to rank neighbors comes 
largely from the interest scores which cannot be precomputed. The 
cost, in JS divergence calculations, is 0(n • /), and is asymptoti¬ 
cally linear in both the number of neighbors n and the number of 
features /. Given our use of MDL histograms for the features, we 
are able to scale the number of features at low linear incremental 
cost (see Figure]^. Each neighbor only requires exactly one JS 
divergence calculation per feature (comparing the user profile and 
the local distribution). 

Since many graphs contain triangles, it is very likely that redun¬ 
dant calls will be made during a user’s exploration. We use this 
to our advantage and cache the distributions for each visited node 
rather than refetching them each time. Graphs with higher cluster¬ 
ing coefficient may achieve better caching performance. For all but 
the Youtube graph, the caching became memoizing as the entirety 
of the nodes could fit in the cache. 

4.3 Case Studies 

Here, we describe three case studies using two graph datasets — 
the first two using the Rotten Tomatoes (RT) movie graph, and the 
last one using the DBLP co-authorship graph. These case studies 
illustrate how Facets helps the user explore and graphs, incre¬ 
mentally gain understanding, discovers new insights, and visualize 
them. Facets adapts to the user to provide surprising and interest¬ 
ing rankings to help the user explore relevant parts of the graphs. 

Movie Example I: Blade Runner 

Our first user John likes mid-90’s post-apocalyptic and action films 
(shown as gray circles in Figure [8^ e.g.. The Crow, Waterworld), 
many of which were well received by critics and audience alike. 
John can add more films that he likes by exploring node-to-node 
or by the search utility. After exploring a few films in such genre, 
John is particularly interested in Blade Runner. Based on why 
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Figure 5: Facets’s neighborhood summary view, which dis¬ 
plays the top ranked neighbors of the given node (left) and dis¬ 
tributions of the current neighborhood’s features (right). Each 
feature is displayed by a compact heat map, which can be ex¬ 
panded into a histogram. Each heat map shows both the global 
distribution (gray) and the node’s neighborhood’s distribution 
(orange). 



Figure 6: Facets ranks neighbors in linear time, which is nec¬ 
essary to handle the large numbers of nodes that a user may ex¬ 
plore. We show the average time to calculate the JS divergence 
for surprise and the combination of surprise and interest over 
a neighborhood of size n. FACETS uses the combined ranking if 
the user has explored enough to have a user profile from which 
we can extract the subjective interest ranking. We tested with 
contiguous node ordering to simulate normal exploration and 
random ordering to simulate a user searching using the table 
view. 




















Current Movies 


Example I 
Blade Runner 


Example II 
Toy Story 


Movies Visited 

Waterworld, Braveheart, Pulp Fiction, 

The Crow, Fargo 


A Bug’s Life, Kung Pu Panda, Jumanji, 

The Incredibles, How to Train Your Dragon 


Top-k by 

Title 

I 

PR 

BC 

EV 

Title 

I 

PR 

BC 

EV 


L.A. Confidential 

1.06 

2.23 

264k 

7.10 

Monsters University 

0.54 

0.76 

1070 

29.3 


Sin City 

1.15 

4.34 

833k 

44.7 

Rio 

0.63 

1.66 

25464 

57.4 

Interest 

Dredd 

1.20 

1.74 

55k 

43.3 

Toy Story II 

0.70 

1.84 

55486 

59.1 


V for Vendetta 

1.23 

2.99 

620k 

43.5 

The Iron Giant 

0.71 

1.43 

54079 

46.3 


Heat 

1.23 

4.18 

385k 

11.0 

ParaNorman 

0.71 

0.94 

23291 

15.1 

Top-fc by 

Title 

S 

PR 

BC 

EV 

Title 

S 

PR 

BC 

EV 


The Creation of the Humanoids 2.93 

0.19 

851 

0.58 

Buzz Lightyear of Star Command 

3.15 

0.14 

12 

2.14 


Demon Seed 

2.75 

0.50 

650 

1.54 

Small Fry 

3.01 

0.17 

132 

3.10 

Surprise 

Natural City 

2.69 

0.28 

437 

0.74 

Monsters University 

2.64 

0.76 

1070 

29.3 


Virtuosity 

2.64 

0.35 

668 

1.61 

Cloudy With a Chance of Meatballs 

2.46 

1.28 

1027 

42.5 


Soylent Green 

2.59 

0.57 

6578 

1.50 

ParaNorman 

2.45 

0.94 

23291 

15.1 


Table 3: Comparing Facets ’s surprise and interest ranking with common importance rankings (I&S is interest or surprise, PR is 
PageRank (xlO“^), BC is betweenness centrality, EV is eigenvector centrality (xlO“^). Each example has a selected film, a user 
profile at the time of selection, and the interesting and surprising neighbors. 
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Eigure 8: Visualizations of the (a) Blade Runner and (b) Toy Story case studies. The gray, circular nodes form a profile of films 
investigated by the user via graph exploration or text-search. The star nodes are the last-clicked node for which Eacets has produced 
the 5 most interesting (red), and surprising (blue) nodes. Nodes can be both surprising and interesting (purple). 



Number of Features 

Eigure 7: Eacets scales linearly in the number of features, 
because the ranking requires exactly one JS divergence calcu¬ 
lation per neighbor for interest and constant time lookup for 
surprise. Because of the compact structure of our MDL bins, 
the incremental cost of adding additional features is quite low. 


John has explored, Facets returns top-ranking surprising and in¬ 
teresting films. The top-5 of each kind are displayed in Tableon 
the left and in Figure [8^ These movies’ interest (I) and surpris¬ 
ingness (S) scores are determined based on conventional measures 
of node importance; PageRank (PR), betweenness centrality (BC), 
and eigenvector centrality (EV). 

Many of the surprising movies would not considered important 
by canonical approaches, partly because Facets ’s surprise rank 
operates on both features and local graph structure rather than global 
structure. Movies that are very heavily connected also face a higher 
chance of matching the global distribution and therefore being less 
surprising. 

While the notion of surprise is often considered “unimportant” 
by conventional metrics, interest exhibits more variety in impor¬ 
tance than surprise. This is especially apparent in both this and the 
next case studies. Here we have selected types of films that have 
really large viewership and represent a very large genre in modern 
film with many potentially similar movies. Both the structure and 
features used in our subjective interest have lead the ranking to¬ 
wards more conventionally important nodes (consider the high PR, 
BC, and EV scores). 

As the highest possible JS-divergence for a single feature is 1; 
the maximum divergence is the sum of feature weights A. In this 
case study, we used five features, so interest scores <1.0 suggest 

























Interesting 


that the user profile is in relatively good agreement with the pro¬ 
posed node. Nodes with very low interest divergence are strong 
candidates that their neighborhood will be of interest to the user. 

Movie Example II: Toy Story 

Our second user Bonnie investigates several children’s films that 
were criticized by the critics, but still enjoyed by audiences (all 
have a lower critics’ score than audience score). Bonnie hops from 
node-to-node across the listed films in the order they appear in Ta¬ 
ble She then selects Toy Story. The results are displayed in 
Table on the right and in Figure [8^ 

As in the previous example, the surprising nodes tend not to be 
conventionally important. Yet, consider the difference in impor¬ 
tance of the interesting films versus the previous example, many 
of these suggestions have significantly lower importance. The sec¬ 
ond profile has less coherent features and they do not draw the the 
interest ranking towards conventionally important nodes, despite 
that Toy Story is a very well connected node. Also note that Mon¬ 
sters University and ParaNorman are featured in both the top-5 
surprising and interesting movies. The surprise score and interest 
score are not counter to each other. In fact, this is an excellent 
example, because these films are both surprising and subjectively 
interesting! 

DBLP Example III 

Our third example uses data extracted from DBLP, a computer 
science bibliography website The graph is an undirected, un¬ 
weighted graph describing academic co-authorship. Nodes are au¬ 
thors. An edge connects two authors who have co-authored at least 
one paper. 

Our user Jane is a first-year graduate student new to data mining 
research. She just started reading seminal articles written by Philip 
Yu (topmost orange star in Figure]^. FACETS quickly helps Jane 
identify other prolific authors in the data mining and database com¬ 
munities, like Jiawei Han, Rakesh Agrawal, Raghu Ramakrishnan, 
and Christos Faloutsos\ these authors have similar feature distribu¬ 
tions as Philip Yu (e.g., very high degree). Jane chooses to further 
explore Christos Faloutsos's co-authors. FACETS suggests Duen 
Horng Chau as one of the surprising co-authors, who seems to have 
relatively low degree (i.e., few publications) but has published with 
highly-prolific co-authors. Among these is Brad Myers (leftmost 
orange start in Figure [^, who publishes not in data mining, but 
in human-computer interaction (HCI). This exploration introduces 
Jane to a new field, and she wants to learn more. Using Facets ’s 
interest-based suggestion, she discovers a community of co-authors 
who have published with Brad; among them, Mary Beth Rosson 
further leads to another community of HCI researchers, which in¬ 
cludes Ben Shneiderman, the visualization guru! 

5. RELATED WORK 

Graph traversal and exploration has been investigated in several 
different areas related to networks. 

Trails and Paths Through Networks 

The information retrieval community has proven an asset in ana¬ 
lyzing the web-browsing paths users covered as they explored the 
web. Click trail analysis has been used to analyze the website-to- 
website paths of millions of users in order to improve the ranking 
of search results 

Intermediate sites and destinations of common trails can be used 
successfully as search results themselves as in (H) If a user is 

"^http ://dblp .uni-trier, de 


Philip S. Yu 

■ Jiawei Han 
'A'Christos Faloutsos 

■ Jeffrey F. Naughton 
■ Rakesh Agrawal 

* Raghu Ramakrishnan 
A Duen Horng Chau 

Surprising 

if Brad A. Myers 

★dVIary Beth Rosson 

■ Scott E. Hudson! _ John M. Carroll 
■ James A. Landay \ ^ Shneiderman 

■ Gregory D. bowd * Catherine Plaisant 

“Chris North 

Interesting Interesting 

Figure 9: Visualization of our user Jane’s exploration of 
the DBLP co-authorship graph. Jane starts with Philip Yu. 
Facets then suggests Christos Faloutsos and several others as 
prolific data mining researchers. Through Christos, Facets 
suggests Duen Horng Chau as a surprising author as he has 
published with both data mining and human-computer inter¬ 
action (HCI) researchers, like Brad Myers. Through Brad, 
Facets helps Jane discover communities of HCI researchers, 
including Ben Shneiderman, the visualization guru. 


following a common trajectory they can even be “teleported” to 
the destination page f26\ . However, in our case we do not have 
millions of explored paths through our input network and cannot 
directly rely on the aggregate analysis of trails used above. 

West et al. analyzed Wikipedia users’ abilities and common pat¬ 
terns as they explored Wikipedia p0| . They observed that users 
would balance between a conceptually simple solution at the cost 
of efficiency-there were more direct routes to their target article 
that made less sense than they path they chose. Their system also 
used trail analysis in order to try to predict where a user would go 
based on the user’s article-trail features. 

Degree of Interest 

The visualization community has also investigated local graph ex¬ 
ploration. Bottom-up exploration first appeared in p^ , a tool for 
exploring hierarchies using a “degree of interest” (DOI) function to 
rank the relevance of currently undisplayed nodes. The idea of DOI 
was later expanded by to apply to a greater set of graph fea¬ 
tures. The Apolo system Q further improves on it to allow users to 
freely define their own arbitrary number of clusters, which it uses 
to determine what to show next, through the Belief Propagation al¬ 
gorithm. Recently, the DOI idea is applied to time-varying settings, 
to capture salient temporal graph changes Q. 

We have built on the idea of using a DOI to determine the ranking 
for what nodes we show users; however, we use a dynamic DOI 
function which changes to suit the browsing behavior of the users 
as they explore their data. 

Surprise and Serendipity 

Algorithms like Oddball ||2|, an unsupervised approach to detect 
anomalies in weighted graphs, can be used to detect surprising 
nodes. Akoglu et al. focused on anomalies based on edge weight- 




ing and not on node-level features. The TANGENT algorithm by 
Onuma et al. is a parameter free technique used to discover sur¬ 
prising recommendations (Tg. They measure the surprise in their 
model by measuring the amount of horizon-broadening each new 
node imparts, where the horizons are edges going to different clus¬ 
ters of users and films. 

Similar to our approach to comparing local and global distribu¬ 
tions, Parameswaran et al. presented the idea of finding inter¬ 
esting dimensions in the context of data cube by comparing their 
distributions to those aggregated. 

Andre et al. studied the potential for using serendipity in Web 
search and the effects of personalization on that potential. They 
found out that serendipity could be useful for certain queries, and 
personalization, ideas which we leverage in Facets. 

De Bie proposed a general framework for measuring the 
interestingness of data mining results as their log-likelihood given a 
Maximum Entropy (MaxEnt) distribution based on the background 
knowledge of the user. Instead, we consider the Jensen-Shannon 
divergence between the local and global histograms as the surpris¬ 
ingness of a node. Our notion of subjective interestingness comes 
closer to that of Tatti and Vreeken p5) , whom proposed to itera¬ 
tively update the model, to avoid redundancy. Our goal is orthog¬ 
onal, as we are specifically interested in identifying nodes that are 
similar to those the user chose to explore. 

6. CONCLUSION 

In this work, we presented Facets, an integrated approach that 
combines visualization and computational techniques to help the 
user performs adaptive exploration of large graphs. Facets over¬ 
comes many issues commonly encountered when visualizing large 
graphs. Facets shows the user only the most subjectively interest¬ 
ing material as they explore. We do this by ranking the neighbors of 
each node by surprisingness and subjective interest based on what 
the user has explored so far. The surprisingness rankings are de¬ 
termined by the divergence of local feature distributions from the 
background feature distributions. The subjective interest rankings 
arise from the divergence between the local feature distributions 
and the user’s current profile, which represents their exploration, in 
node-features, up to that point. 

Our Facets algorithm is scalable and is linear in the number of 
neighbors and linear in the number of features. We demonstrated 
the effectiveness of Facets through case studies using the Rot- 
tenTomatoes movie graph and the DBLP co-authorship graph, and 
comparison with canonical importance ranking measures. The user 
interest ranking influences the direction of a user’s exploration by 
showing the best matches to their current taste in nodes. Despite 
the old adage, you can see the graph through its nodes. 
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